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Abstract — We consider a stylized stochastic model for a 
wireless CSMA network. Experimental results in prior studies 
indicate that the model provides remarkably accurate throughput 
estimates for IEEE 802.11 systems. In particular, the model offers 
an explanation for the severe spatial unfairness in throughputs 
observed in such networks with asymmetric interference condi- 
tions. Even in symmetric scenarios, however, it may take a long 
time for the activity process to move between dominant states, 
giving rise to potential starvation issues. 

In order to gain insight in the transient throughput character- 
istics and associated starvation effects, we examine in the present 
paper the behavior of the transition time between dominant 
activity states. We focus on partite interference graphs, and 
establish how the magnitude of the transition time scales with the 
activation rate and the sizes of the various network components. 
We also prove that in several cases the scaled transition time 
has an asymptotically exponential distribution as the activation 
rate grows large, and point out interesting connections with 
related exponentiality results for rare events and meta-stability 
phenomena in statistical physics. In addition, we investigate the 
convergence rate to equilibrium of the activity process in terms 
of mixing times. 

Index Terms — CSMA networks, throughput starvation, inter- 
ference graph, asymptotic exponentiality, mixing time, conduc- 
tance. 



I. Introduction and motivation 

We consider a stylized model for a network of nodes sharing 
a wireless medium according to a CSMA-type protocol. The 
network is described by an undirected graph (V, E) where the 
set of vertices V represents the various nodes of the network 
and the set of edges E C V x V indicates which pairs of 
nodes interfere. In other words, nodes that are neighbors in the 
interference graph are prevented from simultaneous activity, 
and thus the independent sets correspond to the feasible joint 
activity states. A node is said to be blocked whenever the 
node itself or any of its neighbors is active, and unblocked 
otherwise. Each node activates (starts a transmission) at an 
exponential rate v whenever it is unblocked. The transmis- 
sion durations of nodes are independent and exponentially 
distributed with unit mean. 

This research was financially supported by The Netherlands Organization 
for Scientific Research (NWO) in the framework of the TOP-GO program 
and by an ERC Starting Grant. 



Let fi* C {0, 1} V be the collection of incidence vectors 
of the independent sets of the interference graph (V, E), and 
let X% <E f2* denote the joint activity state at time t, with 
its i-th element indicating whether or not node i is active at 
time t. Then (Xj )t>o is a reversible Markov process JTJ with 
stationary distribution 



n x (v) = lim P{A7 = x} = 

t— >oo 



a: en*, (i) 



with ||x||i = J^iev x i th e num ber of active nodes in state x. 

The above model has a long history Q-Q. It was rediscov- 
ered in the context of IEEE 802.11 systems in J5|, and further 
developed in that setting in |6}, (7). While the modeling of the 
IEEE 802.11 back-off mechanism is less detailed than in the 
model of Bianchi [8|, the general interference graph offers 
greater versatility and covers a broad range of topologies. 
Experimental results in [9] demonstrate that the model, while 
idealized, provides remarkably accurate throughput estimates 
for actual IEEE 802.11 systems. It is also worth observing 
that the model amounts to a special instance of a loss network 
1 10 1, JTT], and that the stationary distribution corresponds 



to the Gibbs measure of the hard-core model in statistical 
physics JT2), (B). 

An activity state in J7* is called dominant if it corresponds to 
an independent set of maximum size max^gQ* \\x\\i- It follows 
from the stationary distribution ([1} that only the dominant 
states retain probability mass as v —> oo. This causes a severe 
problem, called spatial unfairness, when some nodes belong to 
fewer maximum-size independent sets than others as a result 
of asymmetric interference conditions p4) , and receive far 
lower or even zero throughputs. 

Even in symmetric scenarios where the long-term through- 
puts are equal, however, potential starvation issues can occur, 
since it may take a long time for the activity process to move 
between dominant states. Consider for example an interference 
graph where the nodes can be partitioned into two independent 
sets of maximum size. The activity process will spend roughly 
half of the time in each of the two associated dominant 
states as v — y oo. Since each of the nodes is active in 
one of the dominant states, they will all receive equal long- 
term throughputs, so spatial unfairness is not an issue in 



the long run. Yet, another source of severe unfairness arises 
as v — y oo, because it will take an extremely long time 
for the activity process to move from one dominant state to 
the other (resembling meta- stability phenomena in statistical 
physics fl3| , fl6)). As a result, each node will experience long 
sequences of transmissions in rapid succession, interspersed 
with extended periods of starvation. 

In order to gain insight in the above issue, we explore in 
the present paper the behavior of the Markov process (X t *) t >o 
in a scenario with a complete if -partite interference graph 
(V, E). By this we mean that the nodes can be partitioned into 
K disjoint sets called components, such that two nodes block 
each other, if and only if they belong to different components. 
The key contributions of the present paper may be summarized 
as follows: 

(i) We examine the asymptotic behavior (as v — > oo) of 
the transition time between various activity states, exploiting 
classical results for absorption times in birth-and-death pro- 



cesses p7|-| 19 1 and a representation of the transition time as 
a geometric random sum. 

(ii) We establish that the magnitude of the transition time 
scales as ®(v"~ x ), reflecting how 'rigid' the dominant activity 
pattern is as function of the activation rate v and the expo- 
nent /3, which is completely determined by the sizes of the 
various components. 

(iii) We prove that in several cases the scaled transition time 
has an asymptotically exponential distribution and compare it 
with related exponentiality results for rare events [20|-[25|. 

(iv) We investigate the rate of convergence of the activity 
process to the equilibrium distribution, and demonstrate that 
the mixing time is of the same order as the escape time of the 
second-largest component. 

The remainder of the paper is organized as follows. In Sec- 
tion [II] we present a model description and some preliminary 
results. In Section III we examine the asymptotic behavior of 
the transition time in the case of a bipartite graph. Section IV 
describes the extension of the results to arbitrary partite 
graphs. In Section [V] we investigate the rate of convergence to 
equilibrium in terms of mixing times. Section |Vl] concludes 
with some remarks and a review of further extensions. 

II. Model description and preliminary results 

Consider the Markov process (X 4 *) t > as described in the 
introduction with a complete if -partite interference graph 
(V, E). Thus the nodes in V can be partitioned into K disjoint 
sets called components, such that two nodes are connected 
by an edge in E, if and only if they belong to different 
components. In view of the symmetry, all the states with the 
same number of active nodes in a given component can be 
aggregated, and we only need to keep track of the number 
of active nodes, if any, and the component they belong to. 
This state aggregation yields an equivalent Markov process 
(X t )t>o on a star-shaped state space ft with K branches 
which emanate from a common root node and correspond 
to the components of the interference graph. Specifically, 
fl = {0}U{(£;, I) : 1 < I < L k , 1 < k < K}, where the center 



state indicates that none of the nodes is active and state (k, I) 
corresponds to the situation where I nodes are active in the k- 
th component, denoted by Ck, with L k denoting the size of Ck 
(see Figure 1). Notice that £1 can be alternative described as 
{0}U|JfeLi ^ fc ' T ne transition rates of the process (X t )t>o are 
given by q(0, (k, 1)) = L k v, q((k, I), (k, I + 1)) = {L k - l)v, 
l = l,...,L k -l, q((k, 1), 0) = 1, and q((k, I), (k,l-l)) = I, 
l = 2,...,L k ,k = l,...,K. 

C 2 




Fig. 1. Example of the state space Q with K = 4. 

The stationary distribution of the process (X t ) t > reads 

K L k 



**>(") = (i+ EE 

k=l 1=1 



, L k , k = 1, 



(2) 



Denote by 



T (ki,h),{k 2 ,h)( v ) = ini {t > : x t = (k 2 ,l 2 )\X Q = (kuh)} 

the first-passage time of state (ki, h) starting in (ki, h), which 
we also refer to as the transition time from state (fci, h) to state 
(k2,h)- m the next sections we will analyze the asymptotic 
behavior of the transition time as v — > oo. As discussed in 
the introduction, the transition time provides useful insight in 
transient throughput characteristics and starvation effects in 
wireless CSMA networks. 

In preparation for the asymptotic analysis of general scenar- 
ios, we first present a few results for the case where the two 
states (k\,l\) and (k2,h) belong to the same component, i.e. 
ki = k-2 and l\ > I2. The presence of the other components is 
not relevant then for the transition time, and hence we focus 
on the case of just a single component (K — 1), and drop 
the component index for now. When K = 1, the process 
(X t )t>o evolves as an elementary birth-and-death process on 
the state space {L, L — 1, . . . , 1, 0}, so that we can exploit 
several classical results for such processes. 

A. Asymptotic growth rate 

We first state how the expectation of the transition time 
Ti ly i 2 (v) scales as v — > 00. Here and throughout the paper we 
write f(y) ~ g(y) to indicate that lim^oo f{v)/g(v) = 1 as 
v — >• 00 for any two real-valued functions /(•) and g(-). 

Proposition 11.1. For L > l x > l 2 > 0, 



l 2 \{L~l 2 - 1)1 
L\ ' 



.L-U 



Proof: First observe that WF h .i 2 {v) = Y!i=i] ET m _i(», 
so we can exploit a general result for birth-and-death pro- 



cesses (T8[ , which in the present case says that, for < I < L, 

ET,,,_i(l/) = \ Y,n=l N0W <0 im P lieS that T»M = 

o{ttl(v)) as — > oo for all n — I, . . . , L — 1, so that 



ET, 



1,1-1 



M~7 



lTTiM _ (i-OKi-l)I,X-l 



OO. 



I 7Tj(l/) L! 

Thus ET;,;_i(z/) = o(ET, 2+ i, l2 ) as z/ -> oo for all / = ii, . . . , 
?2, and hence ET/ l! ; 2 (^) ~ ETi 2 +ij 2 (i/) as v — > oo. ■ 
In order to gain insight in starvation effects, we are partic- 
ularly interested in the time for the activity process to reach 
the center state 0, referred to as escape time, because at such 
points in time nodes in other components have an opportunity 
to activate. Proposition |II. 1 1 shows that 

ET lu0 (v) ~ \v L ~ x , u^oo. 



L 



(3) 



Hence, the escape time grows as a power of v, where the 
exponent corresponds to the component size minus one, and 
is asymptotically not influenced by the starting state l\. 

B. Asymptotic exponentiality 

Having obtained the asymptotic growth rate, we now turn 
attention to the distribution of the scaled escape time, and show 
that it has an asymptotically exponential distribution. We will 
leverage the following well-known result for birth-and-death 
processes, which is commonly attributed to Keilson (T9| or 
Karlin and McGregor (17} . 



Theorem II.2. Consider a birth-and-death process with gen- 
erator matrix Q on the state space {0, . . . , d} started at 
state d. Assume that is an absorbing state, and that the 
other birth rates and death rates {/ii}f =1 are positive. 

Then the absorption time in state is distributed as the sum 
of d independent exponential random variables whose rate 
parameters are the d nonzero eigenvalues of —Q. 

Let Q{v) be the generator matrix of the birth-and-death 
process {X t )t>o on the state space {L, L — 1, ... ,1,0}, with 
an absorbing state. Let < ot\{v) < OL2{v) < • • • < ai(i') 
denote the non-zero eigenvalues of —Q(v). The property 
that the eigenvalues are distinct, real and strictly 

positive, is well-known [26|, see also the proof of Lemma |II.3| 

we obtain that 



Applying Theorem II. 2 



L 

£ 



Yi{v), 



(4) 



where Y\ (v) , . . . , Yl (y) are independent exponentially dis- 
tributed random variables with EYi(v) = \joniv). 

Intuitively, the eigenvalue ct\(v) will become really small 
as v — > oo, and so the mean escape time ETl${v) will be 
dominated by EY±(u) = \/oti{v). Combining Q and Q 
suggests that 

Indeed, we can make this precise with the following result 
for the growth rates of the eigenvalues as v — > oo and their 
relation to the mean escape time 'KTj J fl{y). The proof of 



the result is presented in Appendix A, and exploits detailed 
information about the growth rates of the eigenvalues obtained 
via symmetrization and the Gershgorin circle theorem. 



Lemma II.3. 

lim oti(v) ■ ET Lt0 (v) 



1. 

00, 



1, 

2, 



,L. 



The above lemma shows that the smallest eigenvalue a.\(v) 
becomes dominant as v — > oo, but also proves the asymptotic 
exponentiality of the escape time. Indeed Q means that the 
Laplace transform of the scaled escape time is 

l 



£ r 1|0 M/ET L|0 (F)(s) = \\ (l + 



Oii(v) ■ ET Lfi (v) 



Lemma II. 3 then implies that 



lim Ct 



l i|0 (i/)/ET I ,,o 



m( s ) = i 



1 



The continuity theorem for Laplace transforms then yields 
that the scaled escape time has an asymptotically exponential 
distribution as stated in the next theorem, where Exp(A) 
denotes an exponentially distributed random variable with 
mean 1/A. 



Theorem II.4. 



AExp(l), 



ET Li0 H 

The above result may be interpreted as follows. For large v, 
state L is frequently visited, while state becomes rare. This 
suggests that the probability of hitting state before the first 
return to state L becomes small. So the time Tl$(v) consists 
of a geometric number of excursions from L which return 
to L without hitting 0, followed by part of the excursion that 
hits 0. Hence, apart from this final excursion, T^ofV) is the 
sum of a large geometric number of i.i.d. random variables, 
which should be asymptotically exponential. 

The fact that the time until the first occurrence of a rare 
event is asymptotically exponential, is a widely observed 
phenomenon in probability [25} . In order to establish expo- 
nentiality of hitting some subset B of the state space, one 
typically decomposes the process into regenerative cycles, and 
assumes that (i) the probability of hitting B in a single cycle 
is small, and (ii) the length of the cycle in which B is hit 
is asymptotically negligible compared with the mean cycle 
length (23), (25). For the case K = 1, both assumptions 



hold, and an alternative proof of Theorem II.4 can be obtained 



using 1 23 Thm.l] (which is a generalized version of the 



theorem proved in |24|). However, this general theorem for 
regenerative processes will fail for the majority of cases 
considered in this paper. For instance, in the case K = 2 and 
L\ = L 2 (two identical components), the process (X t ) t > 
will exhibit bistable behavior, in the sense that, as v — > oo, 
it spends extremely long periods circling around either one 
of the leaf nodes. For analyzing the transition time between 
the two dominant leaf nodes, assumption (i) remains valid, 
because it is still extremely hard to go from one leaf to the 
other, but assumption (ii) will be violated. Indeed, when the 
process crosses all the way from one leaf to the other, the 



length of the latter cycle will be of the same order as the entire 
hitting time. Despite the fact that the regenerative approach no 
longer works, we will prove (using different methods) in the 
next section that the scaled transition time between dominant 
states is still asymptotically exponential. 

Let us finally remark that for reversible Markov chains 
similar exponentiality results were established in pO)-p2). 
Aldous 



[20] showed that a result like Theorem |II.4| can 
be expected when the underlying Markov process converges 
rapidly to stationarity. Indeed, the Markov process (X t )t>o 
for the case K = 1 turns out to have a small mixing time. 
However, the picture changes drastically in the case K > 2, 
for which we prove in Section [V] that the mixing time is 
0(i^ _1 ) with /? the size of the second-largest component; 
hence the mixing time is of the same order as the escape time 
of the second-longest branch, see ([3]). For large v, this implies 
that the Markov process is extremely slowly mixing, which is 
another way of understanding why the regenerative approach 
sketched above fails in this situation. 

III. Bipartite interference graph 

In this section we examine the asymptotic behavior of the 
transition time T(h x ,1^ ,(k 2 ,i 2 ) ( v ) as v 00 f° r anv P a i f 
of states (kx,h) and (fc 2 ,Z 2 ) when the interference graph 
is bipartite. We will establish how the expectation scales 



(Theorem III. 1 1 and use two different methods to prove that 



the scaled transition time has an asymptotically exponential 
distribution when states (kx, h) and (k 2 , ^2) belong to different 
branches (Theorem |III.2| i. Most of these results in fact extend 
to arbitrary partite graphs, as will be shown in the next section. 
However, we treat the case of a bipartite graph separately first, 
since it allows us to develop the key ideas in a relatively 
transparent setting, while being sufficiently rich to exhibit the 
essential qualitative characteristics of general scenarios. 

Consider a bipartite interference graph, i.e. K = 2, where 
the two components C\ and C 2 have sizes L x and L 2 , respec- 
tively. In this case, the two branches of the state space along 
with the root node form a simple linear array. For notational 
convenience, we will therefore relabel the states, indicating 
the number of active nodes in C\ by a negative integer in 
{— Lx, . . . , — 1, } and representing the number of those in C 2 
by a positive integer in {1, . . . , L 2 }. 

The activity process (X t )t>o on il = C\ U {0} UC 2 evolves 
as a birth-and-death process with transition rates 

l\ if - la < I <0, 

(L 2 -l)v ifO<Z<£ 2 , 

(Li - \l\)v if - Li <l <0, 




if < I < L 2 , 



yielding 



■n-i{v) = ttoO) yf^j v \ l = 



7i"mH = tt q (v) ( ^ ) v m , m = l,...,L 2 . 



(5) 
(6) 



A. Asymptotic growth rate 

We first determine how the expectation of the transition time 
T-i lt i 2 {y) scales as v — > 00. 



Theorem III.l. For any < l\ < L\ and < Z 2 < L 2 , 



ET_ 



(7) 



L1L2 

In the CSMA context, the above theorem implies that 
when at least one of the nodes in component k is currently 
active, k = 1,2, it will approximately take an amount of 
before any number of nodes in the other 



time 

component 3 — k will have a chance to transmit. 

Proof: As before, note that for any l\ < Z 2 ET; li / 2 (i^) = 
X^/IT] 1 ETi^ + i so we can again use a general result for 
birth-and-death processes JT8), which in the present case says 

that m,l + i{v) = ^rj+T) EU- £l t& for ^ 1 < L *- 
Now <j5j and |6| imply that iTniy) = o(7T_i 1 (i')) as v — > 00 
for all n = —L\ + ^...jininlL! — 1,L 2 } and Tr n (v) = 
o(iii(v)) as v — > 00 for all n = —Li, . . . , I — 1, when / = 
L\ + 1, . . . , L<x- Thus, for — L\ < I < 0, as v — > 00, 



ET M+ i(i/) 



l TTi{v) Li! 

and, for < I < L 2 , as v — > 00, ETu + i(v) scales as 

if I < Li, 



^-1 *_ Ll (v) _ (L 2 -l-l)\l\ Li-l-1 
(L 2 -l) L 2 \ U 



■n--L 1 (v)+-KL 1 (i>) _ L 2 \+L 1 '.(L 2 -L 1 y. , 



(L2-L1) K Ll (v) 

I/" 1 TTi(u) _ 1 



if I = U 



(1,2-2) 'TlM 



L 2 -r 



{L 2 -Lx)L 2 \ " 

if I > Li. 
(8) 

It follows that ETi t i + x(v) = o(ET_i )0 (^)) as v -> 00 for all 
I = —Lx, . . . , —2 and I = 1, . . . , L 2 — 1. Hence, as ^ — > 00, 



ET_, IiI> (i/)~ET_i i o(i/)+ET ,i(i/) 
which yields |7]). 



£1 



L 2 



Remark 1. Theorem III.l only deals with the case where 



the two states —lx and l 2 belong to different components, 
but extends to the case where the two states belong to the 
same component. In case < l 2 < h < Lx, the transition 
time T_; 1 ._/ 2 (^) is not influenced by the presence of the 
second component, and its expected value thus follows from 



Proposition II. 1 In case < l\ < l 2 < £ 2 , it may be deduced 
from |8j that ET iu i 2 (v) — ETi lt i 1+1 (v) as v -> 00, yielding 



(L 2 -L!)L 2 \ 



-h 



if lx = Lx , u 
if h > Lx 



B. Asymptotic exponentiality 



In the previous subsection we obtained the asymptotic 
growth rate of the mean transition time ET-i lt i 2 (v) as v — > 00. 
We now turn attention to the scaled transition time and will 
prove that it has an asymptotically exponential distribution as 
stated in the next theorem. 



Theorem III.2. For any < l\ < Li and < l 2 < L 2 

Before providing a proof, we first present an interpretation 
of the above theorem. In order for the process to make a 
transition from state —l\ to state I2, it must first reach state 0. 
Proposition |II. ^ indicates that the expected transition time from 
state — L\ to state — 1\ is asymptotically negligible compared 
to the expected transition time from state — 1\ to state 0. 
This means that the transition time from state —l\ to state 
asymptotically behaves as the transition time from state —L\ 
to state 0. Theorem |II.4| shows that the latter time, after 
scaling, has an asymptotically exponential distribution. Once 
the process has reached state for the first time, there are two 
possible scenarios. With probability p = L\j{L\ + L 2 ), the 
process returns to state —1, and then almost surely falls back 
to state —L\ quite rapidly (compared with the time to reach 
state 0). With probability 1 — p, the process moves to state 1, 
and then most likely is attracted to state l 2 quite quickly. 

In conclusion, in order for the process to make a transition 
from state —l\ to state l 2 it must visit state once plus a 
geometrically distributed number of times with parameter p. 
The successive time periods to reach state are independent 
and, after scaling, exponentially distributed. Observing that 
the sum of one plus a geometrically distributed number of 
independent and exponentially distributed random variables is 
again exponentially distributed, we arrive at the statement of 
the above theorem. 

Obviously, the above argumentation is heuristic, and in the 
next two subsections we will provide a rigorous proof. The 
first proof method is analytical in nature, and relies on an 
asymptotic characterization of the eigenvalues of the generator 
matrix, while the second proof is more probabilistic, and in 
fact closely mirrors the above intuitive explanation. 



Remark 2. Theorem III. 2 only deals with the case where 
the two states —1% and l 2 belong to different components, 
but extends to the case where the two states belong to the 
same component. In case < l 2 < h < Li, the transition 
time T-i ± i 3 (y) is not affected by the presence of the 
second component, and thus has an asymptotically exponential 
distribution according to Theorem II.4 In contrast, in case 



< h < l 2 < L 2 , a fundamentally different situation emerges. 
In that case, a series of upward transitions from state l\ to 
state l 2 occurs in rapid succession with high probability, so 
that the transition time converges in distribution to 0, even 
though its expectation may not tend to and in fact grow to 
infinity as v — > 00 when l\ < L%, as observed in Remark [T] 

C. Analytical approach 

We now extend the analytical approach of Section [II] to 
prove the asymptotic exponentiality of the scaled transition 
time as stated in Theorem |III.2| For convenience, we restrict 
attention to l\ = L\, but the proof readily extends to any 



Let Q(f ) be the generator matrix of the birth-and-death pro- 
cess (X t )t>o on the state space {— L\, . . . , — 1, 0, 1, . . . , I2} 
with l 2 an absorbing state. Let < 71(f) < 72(f) < ■ < 
7L 1 +i 2 (f) d enote the non-zero eigenvalues of — Q{v) 

Theorem 
where Yi(f), 



II.2 



implies that T_ Xll , a (i/) = £fi+ ia Y^u), 
Yl 1+ i 2 (v) are independent and exponen- 
tially distributed random variables with ¥Yi(v) — 1 /71(f). 

The following lemma shows that the smallest eigenvalue 
71(f) becomes dominant as v —> 00, and implies the asymp- 



totic exponentiality stated in Theorem III. 2 The proof is 



similar to that of Lemma III. 31 and thus omitted 



Lemma III.3. 

Una 7i(f) • ET_ Ll! ; 2 



1, 

00, 



D. Probabilistic approach 

We now present an alternative, probabilistic approach to 
establish the asymptotic exponentiality of the scaled transition 



time as stated in Theorem III. 2 In contrast to the analytical 
method as used in the previous subsection, the probabilistic 
approach in fact extends to arbitrary partite graphs, but we 
first focus on the case of a bipartite graph in order to 
illuminate the key ideas. The approach relies on a stochastic 
decomposition of the transition time into independent random 
variables which are easier to handle. In order to obtain the 
stochastic decomposition, we consider the evolution of the 
process as it makes a transition from state — 1\ to a state l 2 , 
and define the following random variables: 

• T-i u -x(v): time to reach state —1 for the first time; 

• ^-xoW)'' ti me t0 reacn state once the process has 
reached state —1 for the first time; 

• N: number of times the process makes a transition — > 
— 1 before the first transition 0^1 occurs; 

• ^o -iM' tmie s P ent m sta te before the i-th transition 
back to state — 1, i = 1, . . . , N; 

• Tb,i(f ) : time spent in state before the first transition to 
state 1: 

• ^-1 o( u )' ti me to retum to state after the i-th transition 
back to state — 1, i = 1, . . . , N; 

• Ti : i 2 (v): time to reach state l 2 once the process has 
reached state 1 for the first time. 

By definition, the transition time may be represented as 



h,l: 



(1,-1 

N 

E 

i=l 



T 



(0) 



(9) 



T, 



T 



(i) 

1,0 



• T, 



0,1 



where the dependence on the parameter v is suppressed for 
compactness. We can make the following observations: 
• The random variables T-i lt -i and Tij 2 are distributed as 
typical hitting times for the respective pairs of states; 

0,1, ... ,N, are i.i.d. copies 



• The random variables T 
of a typical transition time T_i.q; 



0) 

1,0' 



The random variables T 1 and T, 



(i) 



i = 1,2, 



are i.i.d. copies of a random variable Tq = Exp((Li + L 2 )v), 



which represents the residence time in state 0; 
• N = Geo(p), v = , l }t , independent of the parameter v\ 



L x +L 2 



• All the random variables are independent. 

Based on the stochastic representation in |9) and the above 
observations, we now proceed to give a proof of the asymptotic 



exponentiality of the transition time as stated in Theorem III. 2 



The proof consists of three main parts: (i) the first part 
establishes that the distribution of the scaled transition time 
T-i 1 .i 2 {i') asymptotically coincides with that of a dominant 
term U{y), which involves a random sum of i.i.d. random 



variables (Proposition III.4i; (ii) the second part serves to 



identify the asymptotic behavior of each of these random 
variables (Proposition |III.5[ ); (iii) the third part then shows that 
a scaled random sum asymptotically behaves as the random 



sum of the scaled terms (Proposition III. 6 1. 

Part (i): According to Theorem |III. 1 1 and Remark [T] the 



mean values of the random variables T 



1,0' 



0,1, 



,N, 



asymptotically dominate, i.e. they are an order-of-magnitude 
larger than those of all other random variables in |9} as 
v — » oo. This suggests that the asymptotic behavior of the tran- 
sition time T-i^{v) will be determined by that of U(v) = 
Sfco T—i 0(^)1 an d m particular that the distribution of the 
scaled transition time T_/ I) j 3 (i/)/ET_j 1) / 2 (i') asymptotically 
coincides, as v — > 00, with that of U (y) JEU {v) . This follows 
as a special case of Proposition |III.4| below, whose proof is 
technical, but relatively straightforward, and omitted because 
of page limitations. In passing we note that the above obser- 
vations also imply that the expectation of TL^ i 2 {v) scales 
as EU{u) = (1 +EAT(i/))ET_i, (f) = kk j ±i ET_ lfi (v), 



with ET_ 10 (i>) 



00 according to 



Proposition II. 1 which corroborates Theorem III. 1 



Part (ii): In order to determine the asymptotic behav- 
ior of the random variable T_ 10 (^), we first observe that 

r_ £l , (y)=r- £l ,-i(i/) + T_i,o(f). According to Theo- 
rem 



III.l 



the mean value of T_£, 



-i(v) is asymptoti- 
cally negligible compared to that of T-^ofV), which sug- 
gests that the asymptotic behavior of T!_i q(i/) is equiva- 

states that 



II.4 



lent to that of T_i, u q{v), Since Theorem 
I 1 -L 1 ,o(i / )/ET_i li o(i/) has an asymptotically exponential dis- 
tribution, this would imply that the same holds for the scaled 
hitting time T_i.o(v)/ET_i i o(c), as is covered as a special 
case of Proposition |III.5| stated and proved below. 
Part (iii): It remains to be shown that U(v)/EU{v) 



with 



— J2iLo ^-1 o(^)' asymptotically behaves as the ran- 
dom variable 1 _ i ^ EN 

J2Zo Y *> when r_ lf0 (i/)/ET_ lj0 (i/) A 
Y, and Y\, Y2, . . . are i.i.d. copies of the random variable Y. 



-.(0 



This follows as a special case of Proposition III. 6 below, whose 
proof is in Appendix B. In particular, if N is geometrically dis- 
tributed and Y is exponentially distributed, then U (v) /EU{v) 
has an asymptotically exponential distribution as well. 

Proposition III.4. Let T(v),U(v), V(u),W(u) be non- 
negative random variables such that 

(i) lim^ 00 EF(^)/E[/(z/) = = lim^oo EW{v)/EU(v); 

(ii) For every v > 0, U - V < st T < st U + W, i.e. Vi > 



¥{U -V >t}< ¥{T > t} < ¥{U + W > t}; 

(iii) U{v)/mj(v) A X as v — > 00, where X is a random 

variable independent of v with continuous c.d.f.. 

Then T(v)/ET(v) -4 X as v -> 00. 

Proposition III.5. For any < I < L, 

->Exp(l), v -> 00. 



Proof: The birth-and-death structure of the process and 
the strong Markov property yield the stochastic identity 
TLfl(y) = Tl.i{v)+TiA v )i which gives the stochastic bounds 
TlA") ~ Tl,i{v) < st T Ji0 (i/) < st T L , (v) (the two terms 
in the low er bound being dependent). It follows from The- 



orem 



II.4 



In 



that T L ,oO)/ET Li o(» A Exp(l) as v -> 00 
order to complete the proof, we can then use Proposition III.4 
taking U(v) = T Lfi {y), v{v) = T L>l (v) and W(v) = 0. The 

0, 



condition which needs to be checked is lim„ 



which follows directly from Proposition II. 1 



EV(t/) 
W\v) 



Proposition III.6. For every v > 0, define Sm{ v ) '■= 
^2i—lXi(i/), where M is an integer-valued random variable 
and {Xi{vy\iyi is a sequence, independent of M, of i.i.d. 
copies of a random variable X(i>), with EX{y) < 00. Assume 
that X{y)/EX(y) A- Y as v — > 00, where Y is some unit- 
mean random variable. Then 



Sm(v) 



1 



M 



»=i 



00, 



where {l^}i>i is a sequence, independent of M , of i.i.d. copies 
of the random variable Y. 

IV. Arbitrary partite graphs 
In this section we investigate the asymptotic behavior of 

the transition time T(i Slt i 1 \jk 2 ,l2)( l/ ) as v ~^ 00 f° r an Y P a t r 
of states (fci, l{) and (fe, I2) in arbitrary partite graphs. While 
the key ideas are similar to those used for the bipartite graph 
in the previous section, there are now some dependencies 
that require specific treatment and some different qualitative 
features that arise in certain scenarios. In particular, it turns 
out that the scaled transition time may no longer have an 
asymptotically exponential distribution. This scenario arises 
when there are longer branches than k\, in which case the 
number of returns to the root node that dominate the transition 
time is no longer geometrically distributed plus one, but just 
geometrically distributed. 

As in the case of the bipartite graph, the proof approach 
involves a stochastic representation of the transition time, but 
some of the terms are now no longer entirely independent. In 
order to obtain the stochastic representation, we consider the 
evolution of the process as it makes a transition from a state 
(ki,li) to a state (^2^2)1 ar, d define the following random 
variables: 

• ^(*i,Ji),(fci,x)( 1/ ) : tmie to reacn state (fci, 1) for the first time; 

• T$ ^ (i / ): time to reach state after state (fci, 1) is visited 
for the first time; 



• N k : number of times the process makes a transition — s- 
(k, 1), k 7^ k2, before the first transition — > (&2, 1) occurs; 

• Tq / fe time spent in state before the i-th transition 
back to state (k, 1), k ^ k 2 , i = 1, • • • , A^; 

• ^o.(fe 2 ,i) t i me spent m state before the first transition 
to state (k 2 , 1); 

• T/£ ^ q(^): time to return to state after the i-th transition 



back to state (fc, 1), fc =^ fc 2 , z = 1, 



,N k ; 



T, 



(k 2 ,i),(k 2 ,l 2 ) 



[v): time to reach state (fe, £2) once the process 



has reached state (&2, 1) for the first time. 

By definition, the transition time may be represented as 



T (kuh),(k2M- T (k u h),(kl,l) + T (k u i),o 



(10) 



+ X! X! (^V(M) + T (M),o) +^0,(AS2,1) + T (fc 2 ,l),(fc2,i 2 ): 
k^k 2 *=1 

where the dependence on the parameter v is suppressed for 
compactness. Define L := Ylk=i ^k, 7 := Lv, and p k := 
L fc /L for all fc = 1, . . . , K. 

We can make the following observations: 

• The random variables T(k u h),(.k u i) and T(k 2 ,i),(k 2 ,l 2 ) are 
distributed as typical transition times for the respective pairs 
of states; 

(i) 

• The random variables TK'^ Q are i.i.d. copies of a typical 
transition time Tiif i),o, i = 0, . . . , Nk, k ^ k 2 , 

• The random variables Tb,(fe 2) i) and Tq / k 1 y k ^ ki, i = 

1, . . . , iVjfe, are i.i.d. copies of a random variable To = Exp(7), 
which is the residence time in state 0; 

• N := Y^kjtki = Gco(l — f>fc 2 ), independent of the 
parameter 1/; 

• Given N = n, N = (N 1 ,...,N k2 - 1 ,N k2+ u...,N K ) 
has a multinomial distribution with parameters n and pi, ... , 

pk 2 -i,Pk 2 +i, ■ ■ ■ ,Pk, with p k =p k /(l-p k2 ) 

P{N = (m, . . . ,nfc 2 _i,n fca+ i, . . .,n K )} 



\n-L,...,n K 



J2 k 



n k 



fc#fc 2 



Pk 2 



J2k^k 2 n k 

m,...,n K 



(11) 



• All the random variables representing time durations are 
mutually independent, as well as independent of the random 
variables N k , k 7^ k 2 . 

Define := max j^ k2 Lj, := {k 7^ k 2 : L k = £*}, 
and p., := \K*\L*/(\K*\L* + L k2 ). 

We first use the stochastic representation ( p~0] > to establish 
how the expectation of the transition time scales. 

Theorem IV.l. For < h < L kl , < h < L k2 , ki ^ k 2 , 



ET (k u h),(k 2 M)(") 



— h - — )v * , v -> 00. 

L k2 J 

In the CSMA context, the above theorem implies that for 
any component k when some other component is presently 
active, it will roughly take an amount of time of the order v L , 
L* = maxj^fe Lj before any number of nodes in component k 



will get an opportunity to transmit. The proof of the above 
theorem is subsumed in that of the next one. 

We now proceed to determine the asymptotic distribution 
of the scaled transition time. 

Theorem IV.2. For i)<h< L kl , < l 2 < L k . 2 , ki ^ k 2 , 



T, 



{ki,h);(k 2 ,l 2 ) 



ET (fc 1 ,Zi),(fe 2: i 2 )( Jy ) 



I \ 1 M 



EM 



where Af = Geo(j>*) + I{ kl eK,} an d Y% are independent 
exponentially distributed random variables with unit mean. 

Proof: The proof is similar to that for the bipartite 
graph and is based on the stochastic representation ( fT0| ) of 



the transition time Ti 



that ETi 



(ki,h).(k 2 ,l 2 ) 



(fei,Zi),(fei,l) 



(u). Proposition III. 1 1 implies 

(is) ~ v L *i- 2 and ET( kjl)>0 (i7fZ v^' 1 

for every branch k. Moreover, ET , ,(fc,i)(' y ) = °(1)> and it is 
easily verified that ETf k2 ^-\f k2 ^ 2 \{v) = o(^ i *~ 1 ) similarly 
as in the proof of Theorem HI. 1 1 Thus the asymptotically 
dominant term in ( fT0| ) is 



^.lj.oM^eK,} 



keK, t=l t=l 

for M = Geo(p«) + I{ kl eK*} and for any k g K*, where 
the latter stochastic equality follows from ( fTTj ) and the fact 
that the random variables Tt k< i\ are identically distributed 
for all k € K*. Taking X^y) ■= T ( k i),o(v) and applying 



M 



Propositions III.4|III.6" then completes the proof. Also, noting 



that EM = I {kl eK t }+P*/(l-P*) andET (M) , = v L ~ l /L* 
yields the statement of Theorem |I V. 1 1 ■ 
The fc-th branch, k 7^ k 2 , is called weakly dominant if k £ 
K*, i.e. if L k > Lj for all j ^ k 2 . Based on Theorem IV.2 we 



may distinguish two scenarios, depending on whether branch 
k\ is weakly dominant or not. 

Suppose that branch k\ is weakly dominant, i. e. L k . = L*. 
In this case M = Geo(p*) + 1 and so Theorem IV.2 implies 
that for every < h < L kl and < l 2 < L k2 

T (k u h),(k 2 J 2 )(v) d„ n , 

— ^ ^ '—— -^Exp(l), v^co. 

ET (fci,/i),(fe 2 ^ 2 )(^) 

Thus in this case the scaled transition time converges to an 

exponential random variable with unit mean as v — > 00. 

Suppose instead that branch k% is not weakly dominan t, i.e 

Lfcj < L*. In this case M = Geo(p*) and so Theorem 

implies that for every < l\ < L kl and < l 2 < L k2 



IV.2 



r (fci^i),(fc 2 J 2 )(^) 1 - 

ET (fci,ii),(fc 2 ,Z 2 )M P- 



P- 



Geo(p») 

E 

i=l 



where Yi are independent and exponentially distributed ran- 
dom variables with unit mean. 

V. Mixing times 

In the previous sections we have analyzed the transient 
behavior of our Markov process (X t )t>o in terms of hitting 
times. In this section we turn attention to the long-run behavior 



of our Markov process and in particular examine the rate of 
convergence to the stationary distribution. We measure the rate 
of convergence in terms of the total variation distance and 
explore the intimate connection between the hitting times and 
the so-called mixing time of our Markov process. The mixing 
time describes the time required for the distance to stationarity 
to become small. It turns out that the mixing time is largely 
determined by the time it takes the process to escape from 
the second-longest branch in the state space, as formalized 
in Theorem | V. 1 1 below. In this section, we assume without 
loss of generality that the branches are indexed such that 
L\ > L% > ■ ■ ■ > Lk, and denote by dk the leaf node of the 
fc-th branch. Also, we attach the starting state X(0) = x E ft 
as a superscript to our notation for the Markov process, and 
thus write (Xf )t>o- 

A. Main result 

Our objective is to bound the maximal distance over x € 0, 
measured in terms of total variation, between the distribution 
at time t and the stationary distribution: 

d(t) :=max\\F{X? e ^-ttHtv- 

We define the mixing time of our process as 

t miK (e,v)=in£{t>0:d(t)<e}, 

Theorem V.l. The mixing time of the Markov process (X t )t>o 
satisfies 

i.e. 3Ci(e),C2(e) > 3 vq > such that for all v > vq 
Cx(e) v h -~ x < t mix (e,i/) < C 2 {e)v L >-\ 



Theorem V.l shows that it can take an extremely long 
time for the process (X t )t>o to reach stationarity, especially 
when v is large. Such a long mixing time is typically due to 
the process being stuck for a considerably period in one of 
the components, and thus not visiting the states in the other 
components. This is particularly relevant when in the network 
has two or more dominant components which together attract 
the entire probability mass in the limit as v — s- oo. Indeed, in 
this case the mixing time provides an indication how long it 
takes for a certain fairness among the dominant components to 



occur. We will prove Theorem V. 1 by deriving an upper bound 
for d(i) using coupling in Subsection V-B and a matching 
lower bound for d(t) using the bottleneck ratio and the notion 
of conductance in Subsection IV-CI 

B. Upper bound using coupling 
It can be easily established that 

d(t) < d(t) := max ||P{X? € •} - ¥{Xf G •} || TV . 

We consider d(t) instead of d(t), because it can be bounded 
using a standard coupling technique. Consider all couplings 
of the processes (Xf,Xf) with the property that both (Xf) 
and (Xf) are Markov processes that have the same generator. 
Moreover, assume that the coupling is such that the two 



processes stay together at all times after they have met for 
the first time. Under these assumptions, we have that 

||p{*?€-}-P{x t »e.}||Tv 

= max|P{X* e A}-F{Xfe A}\ 

= maxP{JT? € A, X+ 4 A} 
Acn 

<p{xf ^xn = p{<4 le >t}, 

where the coupling time T^ o ^ pl0 = min{i > : Xf — Xf} 
denotes the first time the two processes (Xf) and (Xf) meet. 



Therefore, d(t) < d(t) < max XyVen P |r c x o ^ plc > tj , and the 
strength of this coupling inequality depends of course heavily 
on the choice of the coupling. 

We now introduce a birth-and-death process (M t ) t >o that 
will play a crucial role in our coupling. Let (M t ) t >o describe 
the position of a particle that lives only on the two longest 
branches of the state space C\ U {0} U starts in the leaf 
node c>2, and moves within the two branches according to the 
same transition rates as (X t ) t >Q. Call the particle that moves 
according to (M t ) t >o particle 0. Consider also a particle 1 
and particle 2 whose positions are governed by the coupled 
Markov process (Xf,Xf). To be more specific, we denote the 
exact position of particle i at time t by (K\ , L\), i = 0,1,2, 
with K I the branch and L\ the level. 

Proposition V.2. For all x,y € £1, the coupling time T c ^ ple , 
with x — (k\,l\) and y — (k2,h) is stochastically bounded 
from above by the absorption time Tq 2 ^ 1 of state d\ of 
the continuous-time birth-and-death process (M t )t>o starting 
from state Mq = 82- 

Proof: The coupling is such that particles 1 and 2 stay 
together at all times after they have met for the first time. 
Before that time, whenever a particle i (i = 1,2) resides 
in Ci U C2, it is coupled to particle in the following way: 
whenever particle moves towards the root, particle i moves 
towards the root. In order to construct this coupling, we 
introduce a Poisson clock with rate L\ . When the clock ticks, 
we first generate a [0, 1] uniform random variable U and then 
do the following: move particle i down one level (towards the 
root) if U < L\jL\\ also move particle down one level 
if U < L\jL\. This coupling ensures that when L\ = L\ 
(i = 1,2) and particle moves down one level, then so 
does particle i. A consequence of the above coupling is that 
whenever particle enters C\ both particle 1 and particle 2 
reside in C\. Extend the coupling by assuming that when 
particle meets a particle i (i = 1,2) in some state in C\, 
the two particles keep making the same transitions as long as 
they are in C\. Hence, inevitably, by the time particle reaches 
d\, the particles 1 and 2 are coupled. ■ 
Since, for every v > and for all x,y € il, / rf u P i (^) <st 
Tg 2 _Q 1 (v) : we arrive at the following result: 



d(t)<d{t)<¥{T d2tdl (u)>t}< 



E7 d 2 ,9iO) 



(12) 



It then follows immediately that t m i X (e, v) < e 1 WTq 2 ^q 1 (v). 



Using Theorem IV. 1 we thus obtain the following upper 
bounds on the distance to stationarity and the mixing time. 

Proposition V.3. For the Markov process (X t )t>a the maxi- 
mal total variation distance is bounded by 



t t \ L 1 L 2 

and the mixing time is bounded by 

imixM) < ^^M- 1 ( Ll T + T L2 ) V L *- l ,V^OC. 

e e \ L\Li J 

C. Lower bound exploiting the bottleneck 

Consider the activity process (X t ) t >rj with activation rate 
v. For S C il, let tt(S) := Yl(k l)eS w (k,l)( l/ ) ^> e me 
stationary probability of S. Define the flow rate out of S as 

Q(S, S c ) := Y,(k,l)eS,{j,m)eS^ik,l)( v )Q{(kJ),(j,rn)) and 
the conductance of S as $(#) := Q(S, S°)/n(S). 
The conductance of the process (X t )t>o is defined as 



min <1> C'S') - 

S:tt(S)<1/2 



All the quantities we just defined depend on v, but we 
suppress it for conciseness. 

The following result, valid for any Markov process on a 
finite state space with conductance shows how the 
conductance of the process yields a lower bound on the mixing 
time. It is a continuous-time version of Theorem 7.3 in [ |27| 
and since the proof is quite similar, it is omitted. 

Lemma V.4. For e e (0, \), t mix (e) > (| - 2e) 

We now exploit the fact that our activity process on the 
state space il has a geometric feature usually referred to 
as bottleneck, that strongly influences the mixing time. This 
bottleneck indeed makes some parts of ft difficult to reach, 
resulting in a small conductance. As it turns out, C 2 will be 
the bottleneck. 

Proposition V.5. The conductance of C2 satisfies 

$(C 2 ) ~ L 2 v 1 -^ 
and hence, for e 6 (0, |), 



v — \ 00, 



Ln-l 



OO. 



(13) 



Proof: Since C\ U {0} will have at least half of the 
probability mass for v sufficiently large, it is clear that 
""(C2) < 1/2 when v — s- 00. From |2]) it follows that 
if I and m belong to the same component k of size L, 



then 



7r (fc,m)(' / ) 



7r (fc,!)( ,y ) 

conductance of C 2 satisfies 



-I \l- — s ' vm a ^ '' 



*(C 2 ) 



7i"(2,l)M ' 1 



^(2,1) (^) 
7r (2,I. 2 )(^) 



00. Thus the 



Then Lemma |V.4| gives the lower bound ( fT3j ), since by 
definition < $(C 2 ). ■ 



VI. Conclusion and extensions 

We have examined transient throughput characteristics and 
associated starvation effects in CSMA networks in terms of 
the transition times between dominant activity states. We 
established how the magnitude of the transition time scales 
with the activation rate and the sizes of the various network 
components in partite interference graphs. We also proved that 
in several cases the scaled transition time has an asymptotically 
exponential distribution and discussed the connection with 
related exponentiality results for rare events and meta-stability 
effects in statistical physics. In addition, we investigated the 
convergence rate to equilibrium of the activity process in terms 
of mixing times. 

In the present paper we have focused on partite interference 
graphs with uniform activation rates, giving rise to a star- 
shaped state space, but most of the methods and results 
extend to more general networks, as long as there is a 
unique path between any two activity states. For example, 
we could allow for the branches to be general trees, with 
Mfc i nodes in the fc-th tree at distance I from the root, and 
transition rates b^ if^ ^v) and j away from and towards 
the root, respectively. The node-dependent functions fk,i{v) 
are particularly relevant, since taking fk,i{v) = v 1 l Lk could 
for example serve to balance the long-term throughputs in 
the various components. In those cases, the expectation of 
the transition time will scale differently, but most of the 
distributional results for the scaled transition time continue to 
hold, even though qualitatively different scenarios arise, with 
some branches being visited for relatively short periods but an 
overwhelmingly large number of times. 

References 

[1] F. P. Kelly, Reversibility and Stochastic Networks. Wiley, 1979. 

[2] R. Boorstyn, A. Kershenbaum, B. Maglaris, and V. Sahin, "Through- 
put analysis in multihop CSMA packet radio networks," IEEE Trans. 
Commun., vol. 35, no. 3, pp. 267-274, 1987. 

[3] F. R Kelly, "Stochastic models of computer communication systems," J. 
Roy. Statist. Soc. Ser. B, vol. 47 (3), pp. 379-395, 1985. 

[4] A. Kershenbaum, R. Boorstyn, and M. Chen, "An algorithm for eval- 
uation of throughput in multihop packet radio networks with complex 
topologies," IEEE J. Sel. Areas Commun., vol. 5, no. 6, pp. 1003-1012, 
1987. 

[5] X. Wang and K. Kar, "Throughput modelling and fairness issues in 
CSMA/CA based ad-hoc networks," in Proc. IEEE Infocom, 2005, pp. 
23-34. 

[6] M. Durvy and R Thiran, "A packing approach to compare slotted and 
non-slotted medium access control," in Proc. IEEE Infocom, 2006, pp. 
1-12. 

[7] J. Shi, O. Gurewitz, V. Mancuso, J. Camp, and E. W. Knightly, 
"Measurement and modeling of the origins of starvation in congestion 
controlled mesh networks," in Proc. IEEE Infocom, 2008, pp. 1633- 
1641. 

[8] G. Bianchi, "Performance analysis of the IEEE 802.11 distributed 
coordination function," IEEE J. Sel. Areas Commun., vol. 18, no. 3, 
pp. 535-547, 2000. 

[9] S. Liew, C. Kai, H. Leung, and P. Wong, "Back-of-the-envelope com- 
putation of throughput distributions in CSMA wireless networks," IEEE 
Trans. Mob. Comput., vol. 9, no. 9, pp. 1319-1331, 2010. 
[10] Y. Suhov and U. A. Rozikov, "A hard-core model on a Cayley tree: 
an example of a loss network," Queueing Systems, vol. 46, no. 1, pp. 
197-212, 2004. 

[11] S. Zachary and I. Ziedins, "Loss networks and Markov random fields," 
/ Appl. Prob., vol. 36, no. 2, pp. 403-414, 1999. 



[12] R. Gouet and A. Sudbury, "Blocking and dimer processes on the Cayley 
tree," /. Stat. Phys., vol. 130, no. 5, pp. 935-955, 2008. 

[13] O. Haggstrom, "Ergodicity of the hard-core model on 7? with parity- 
dependent activities," Ark. Mat., vol. 35, no. 1, pp. 171-184, 1997. 

[14] P. M. van de Ven, J. S. H. van Leeuwaarden, D. Denteneer, and A. J. 
E. M. Janssen, "Spatial fairness in linear random-access networks," 
Performance Evaluation, vol. 69, no. 3-4, pp. 121-134, 2012. 

[15] F. den Hollander, "Metastability under stochastic dynamics," Stock. 
Proc. Appl, vol. 114, no. 1, pp. 1-26, 2004. 

[16] E. Olivieri and E. Scoppola, "Markov chains with exponentially small 
transition probabilities: first exit problem from a general domain. I. The 
reversible case," /. Stat. Phys., vol. 79, no. 3-4, pp. 613-647, 1995. 

[17] S. Karlin and J. McGregor, "Coincidence properties of birth and death 
processes," Pacific J. Math., vol. 9, pp. 1109-1140, 1959. 

[18] J. Keilson, "A review of transient behavior in regular diffusion and birth- 
death processes. Part II," J. Appl. Prob., vol. 2, no. 2, pp. 405^428, 1965. 

[19] , "Log-concavity and log-convexity in passage time densities of 

diffusion and birth-death processes," J. Appl. Prob., vol. 8, no. 2, pp. 
391-398, 1971. 

[20] D. J. Aldous, "Markov chains with almost exponential hitting times," 
Stock. Proc. Appl., vol. 13, no. 3, pp. 305-310, 1982. 

[21] D. J. Aldous and M. Brown, "Inequalities for rare events in time- 
reversible Markov chains I," in Stochastic Inequalities, ser. IMS Lecture 
Notes Monogr. Ser., 1992, vol. 22, pp. 1-16. 

[22] , "Inequalities for rare events in time-reversible Markov chains II," 

Stock. Proc. Appl, vol. 44, no. 1, pp. 15-25, 1993. 

[23] R. Griibel and M. Reich, "Rarity and exponentiality: an extension of 
Keilson's theorem, with applications," J. Appl. Prob., vol. 42, no. 2, pp. 
393^406, 2005. 

[24] J. Keilson, "A limit theorem for passage times in ergodic regenerative 

processes," Ann. Math. Statist., vol. 37, no. 4, pp. 866-870, 1966. 

[25] , Markov Chain Models-Rarity and Exponentiality. Springer, 1979. 

[26] W. Ledermann and G. Reuter, "Spectral theory for the differential 

equations of simple birth and death processes," Philos. Trans. Roy. Soc. 

London. Ser. A., vol. 246, pp. 321-369, 1954. 
[27] D. Levin, Y. Peres, and E. Wilmer, Markov Chains and Mixing Times. 

AMS, 2009. 

Appendix 

A. Proof of Lemma \H.3\ 

Order the state space as £1 = {L,L — 1,...,1,0} and 
consider the generator matrix Q(v) of the process (Xt)t>o 
with an absorbing state. That is, 

'qhiy) L 
v qh-\{v) L-l 



Q{v) = 



(i-l)i/ 




?i 0) 




where the diagonal elements are qi{v) = 
I = 1, . . .,£. Write Q{v) as 

T» t(i/) 




-(£ - l)v - I for 



QO) 



where T(V) is an L x L invertible matrix. Since the char- 
acteristic polynomials of — Q(v) and — T(v) satisfy the re- 
lation p_Q(„)(,z) = — zp-T(i^) { z )i the spectrum of — Q(v) 
consists of that of — T(V) plus the eigenvalue zero with 
multiplicity one. Denote by D(v) the L x L diagonal matrix, 
whose diagonal entries are {y ' Oi{v)}\_ L , where the f9's are 
the so-called potential coefficients, defined as Ol(v) = 1 
and 6i-i{v) = , L ^ V 9i{v). The L x L matrix G(v) = 
— D(v) 1 / 2 T(V) D(v)~ x l 2 is tridiagonal and symmetric with 
diagonal entries gi,i{v) = q L _ l+1 (i/) and gi,i+i{v) = 



9i+i,ii v ) = —y/l- (£ — / + V)v. Since G(y) is similar to 
— T(z/), they have the same spectrum. Denote by T>(p, R) 
the closed disc centered in p with radius R, i.e. T>(p, R) = 
{z £ C : \z — p\ < R}. Consider the Gershgorin discs 
{Vi{v)}f=i of G(y), defined as V x {v) := 2?(-®(i/),iZ,(i/)), 
where the radius Ri(v) is the sum of the absolute values of 
the non-diagonal entries in the L — 1 + 1-th row, i.e. Ri(v) := 
E m /i-! + i |ffi-l+i,mM|. Then 

V L {v)=V{L,Vu>), 
V L -i{y) =V(L-l + u, VIv + y/2(L-l)u), 



V 2 {y) = X>(2 + (£ - 2)i/, y/Z(L-2)v + y/2{L-l)v), 

X>i(i/) = P(l + (£- l)v, y/2[L-l)v). 

We now exploit the second Gershgorin circle theorem, 
which is reproduced here for completeness. 

Theorem. If the union of j Gershgorin discs of a real 
r x r matrix A is disjoint from the union of the other r — j 
Gershgorin discs, then the former union contains exactly j 
and the latter the remaining r — j eigenvalues of A. 

In our case, for v sufficiently large, the disc T>l(v) does 
not intersect with the union {jf = i T>i(y), thus the smallest 
eigenvalue a\(v) lies in T>l(v) and the other L — 1 ones in 
UfcTi 1 T^iW)- Hence, for v sufficiently large, a\{y) < L+\pLv 
and on(v) > (£ - 1) + v - ^/v(VL + y/2(L - 1)) for 



i = 2, 



, L. Therefore, < 



< 



L + VLu 



and so lim^^^ ai(v) / a,i{v) = for i = 2, . . . , £ Hence, 

, N , S Oil W) 



OLi{v) 



1. 



while for 2 < i < L, 



q%{y) 

ol\{v) 



OO, V 



B. Proof of Proposition III.6\ 

Using ESWO) = EM ■ EX(i/), gives 



1 M 

— Y 



ES M (v) EMEX(v) EM f^EXi(u) 
Thus the Laplace transform of Sm/ESm may be written as 

= G M (CxiM/KXiW^^)) , 

where G M {z) = E{z M ). The assumption X{v)/EX(v) A Y 
implies that lim^oo C Xi {u)/%Xi{v){t) = £, Y (t) for all t > 
and i £ N. Hence 

Jim C Sm{v)/%Su {u)(s) = Jim G M (Cx^/mx^) (^)) 

= Gm ( Cy (e^m))' 

which is the Laplace transform of S»=i Invoking the 
continuity theorem for Laplace transforms concludes the proof. 



